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Abstract 

The complex, tetraploid genome structure of peanut (Arachis hypogaea) has obstructed advances in gen- 
etics and genomics in the species. The aim of this study is to understand the genome structure of Arachis by 
developing a high-density integrated consensus map. Three recombinant inbred line populations derived 
from crosses between the A genome diploid species, Arachis duranensis and Arachis stenosperma; the B 
genome diploid species, Arachis ipaensis and Arachis magna; and between the AB genome tetraploids, 
A. hypogaea and an artificial amphidiploid (A. ipaensis x A. duranensis)** , were used to construct genetic 
linkage maps: 10 linkage groups (LGs) of 544 cM with 597 loci for the A genome; 10 LGs of 461 cM with 
798 loci for the B genome; and 20 LGs of 1 442 cM with 1 469 loci for the AB genome. The resultant maps 
plus 13 published maps were integrated into a consensus map covering 2651 cM with 3693 marker loci 
which was anchored to 20 consensus LGs corresponding to the A and B genomes. The comparative genomics 
with genome sequences of Cajanus cajan, Glycine max, Lotus japonicus, and Medicago truncatula revealed 
that the Arachis genome has segmented synteny relationship to the other legumes. The comparative maps 
in legumes, integrated tetraploid consensus maps, and genome-specific diploid maps will increase the 
genetic and genomic understanding of Arachis and should facilitate molecular breeding. 
Key words: Arachis spp.; comparative genomics; genetic linkage map; integrated consensus map; legume 
genome 



1 . Introduction 

Fabaceae, or Leguminosae, is composed of ~700 
genera and 2 0 000 species. 1 The family is classified 
into three major subfamilies Mimosoideae, Caesal- 
pinioideae, and Papilionoideae. 2 Papilionoideae is 



divided into four clades Genistoids, Dalbergioids, 
Phaseoloids, and Galegoids. Most tropical and cool 
season legumes used as crops and studied as model 
plants for symbiosis of rhizobium are members of the 
Phaseoloids, e.g. genera Cajanus, Glycine, Phaseolus, 
and Vigna, or the Galegoids, e.g. genera Lotus, 
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Medicago, Pisum, Trifolium, and Vicia. 2 Therefore, 
legume genetics and genomics, including genome 
sequencing and comparative genomics, have greatly 
advanced in the Phaseoloids and Galegoids. 3-9 

The Dalbergioids, which include the genus, Arachis, 
are more basal in their divergence within the 
Papilionoideae than the Phaseoloids and Galegoids. 
The genus Arachis comprises 80 species (2n = 2x = 
18, 2n = 2x=20, and 2n = 4x = 40) possessing A, 
B, C, D, E, F, K, R, and T genomes. 1 0-1 2 As regarding 
the A genome diploids consisting of 1 5 wild species, 
they are morphologically similar, cross compatible, 
produce fertile hybrids with near normal chromo- 
some pairing, and have a common karyotype struc- 
ture. 13 The B-genome diploids also have similar 
morphological and chromosome feature, cross- 
compatibility, bivalent formation at meiosis in inter- 
specific hybrids, and the same karyotype structure. 12 
Peanut (Arachis hypogaea), or groundnut, is an autog- 
amous allotetraploid legume (2n = 4x = 40) har- 
bouring homoeologous A and B genomes that are 
derived from two diploids, most likely /4rac/7/s duranen- 
sis (A genome) and Arachis ipaensis (B genome). 14,15 
While peanut is an important source of food and oil, 
the genetics and genomics of this legume lag far 
behind those of the Phaseoloids and Galegoids due 
to a complex genome structure with challenging fea- 
tures, such as tetraploidy, low genetic diversity, and a 
relatively large genome size of 2.8 Gb. 16 

Molecular genetic studies of Arachis initially pro- 
gressed by using diploid rather than tetraploid peanut 
due to the greater simplicity of diploids as genetic 
models. 17 The first genetic linkage map in genus 
Arachis was developed based on an interspecific cross- 
ing between diploids Arachis stenosperma (A genome) 
and Arachis cardenasii (also A genome), 17 followed 
by a cross between A. duranensis and A. stenosperma 
(both A genomes) 1 8,1 9 and between A. ipaensis and 
Arachis magna (both B genomes). 20 However, because 
of the small number of available DNA markers at the 
time, the number of mapped marker loci was limited 
to between 117 and 369. Recently, intraspecific 
maps for A-genome diploid (A. duranensis) and B- 
genome diploid (Arachi batizocoi) have been developed 
with 1 724 and 449 marker loci, respectively, including 
single nucleotide polymorphisms and expressed 
sequence tag-simple sequence repeats (EST-SSRs) 
markers. 21,22 

Meanwhile, an artificial amphidiploid derived from 
a three-way cross between A. batizocoi (B genome) 
and a hybrid between A. cardenasii (A genome) 
and A. diogoi (A genome) has been crossed with 
A. hypogaea to introduce genetic diversity from wild 
diploid species into the tetraploid-cultivated peanut. 
The BCt progenies of the cross have contributed to 
the generation of a tetraploid genetic linkage map 



with 370 restriction-fragment length polymorph- 
isms. 23 In addition, another artificial amphidiploid 
line, derived from a hybrid between A. duranensis 
and A. ipaensis, was used to establish a genetic 
linkage map made up of 298 SSR marker loci. 24 In 
the latter map, the linkage groups (LGs) were firstly 
anchored to the A and B genomes by identifying the 
genome origins of the mapped loci in the LGs based 
on the corresponding sizes of the DNA amplified 
from the original diploids. In the tetraploid-cultivated 
peanut (A. hypogaea), the population-specific linkage 
maps comprise 318 marker loci. Due to low 
genetic diversity, 25-33 this is the maximum number 
of marker loci for population-specific linkage maps, 
although >6000 SSR markers have been developed 
for mapping. 34 By integrating the 1 1 linkage maps 
of the artificial amphidiploids and cultivated tetra- 
ploid peanuts, a reference consensus map consisting 
of 897 marker loci was constructed and annotated 
with consensus nomenclature for the LGs (a01 -a1 0 
and b01 -b1 0). 35 

Concurrent with the progress of the above genetic 
studies in Arachis spp., a high-density linkage map of 
cultivated peanut covering 2166cM with 1114 
marker loci has also been reported. 36 Because most 
of the markers on this map were published after the 
development of the other Arachis maps, 36-38 the 
markers do not appear on them. 1 7-20,23-32 In add- 
ition, due to an insufficient number of commonly 
mapped loci in the published diploid maps, the LGs 
have not been assigned to either A or B genome 
types, even though homoeologous LGs have been 
identified. 36 Anchoring the genetic LGs to the A and 
B genomes will enable enrichment of marker 
density on the consensus and diploid maps. 19,20,35 
This would contribute to a better understanding of 
genome evolution within Arachis and among the 
Fabaceae. In addition, it would serve to increase the 
efficiency of introgression of desirable alleles from 
diploid wild species into the tetraploid-cultivated 
peanut through marker-assisted selection. 

Comparative analysis of the A genome diploid 
Arachis species, e.g. A. duranensis and A. stenosperma, 
and model legumes has previously been performed. 
This comparative analysis revealed candidate synteny 
blocks between the genomes of Lotus japonicus, 
Medicago truncatula, and Glycine max. 21,39 To 
date, no comparative analysis of cultivated peanut 
and other legume species has been reported. 
Comparison of the tetraploid Arachis genome, repre- 
sented by the high-density consensus map of the 
genomes of legumes Cajanus cajan, G. max, L. japoni- 
cus, and A/I. truncatula, would provide further insights 
into the legume genomes. 

To address the above issues, it was considered that 
high-density linkage maps of the tetraploid and 
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diploid species, on which common makers were 
mapped, would be required. Therefore, we tried to 
map the large number of DNA markers reported for 
the two cultivated peanut maps published by 
Shirasawa et al. 36 onto the previously reported 
linkage maps of the A and B genomes, 19,20 along 
with that of an artificial amphidiploid derived from 
a hybrid of diploid species. The established linkage 
maps were integrated with 1 3 reported tetraploid 
Arachis maps to increase the number of mapped loci 
on the previously published consensus map. 35 
Subsequently, the maps were subjected to compara- 
tive analysis with four legume genomes, C. cajan, 
G. max, L. japonicus, and M. truncatula, to clarify 
features of the genome structure of the genus Arachis. 

2. Materials and methods 

2.1. Plant materials 

In previous studies, two F 2 mapping populations, 
derived from interspecific crosses between the two A 
genome diploid species, A. duranensis 'K7988' and 
A. stenosperma 'V10309', and between the two B 
genome diploid species, A. ipaensis 'K30076' and 
A. magna 'K30097', were used to construct genetic 
linkage maps. 19,20 In the present study, the F 5 and 
F 6 progenies of the diploid A and B genome 
mapping populations, respectively, were generated 
by single seed descent, and used as recombinant 
inbred mapping populations. The A and B genome 
mapping populations were named as AF5 (« = 89) 
and BF6 (n = 94), respectively. In addition, other 
recombinant inbred lines were used to construct AB 
genome tetraploid (n = 91: population TF6) maps. 
This population consisted of F 6 lines derived from a 
cross between A. hypogaea 'Runner IAC 886' and an 
artificial amphidiploid (A. ipaensis x A. duranensis) 4 *, 
which was developed by hybridizing A. ipaensis 
'K30076' and A. duranensis VI 41 67'. Genomic DNA 
from each line was extracted from young leaflets es- 
sentially as described by Grattapaglia and Sederoff. 40 

2.2. Polymorphism analysis with DNA markers 

A total of 3902 DNA markers comprising 1894 
genomic SSRs A. hypogaea genomic SSR (AHGS), 36 
1571 EST-SSRs A. hypogaea EST-SSR (AHS), 37 and 
43 7 transposon-based markers A. hypogaea transpos- 
able element (AhTE) 36,38 were screened against the 
parental lines of the populations AF5, BF6, and TF6 
using an ABI-3730x/ fluorescent fragment analyser 
(Applied Biosystems). Polymorphic markers were 
analysed on the three mapping populations using 
10% polyacrylamide gels for markers exhibiting 
> 1 0-bp allele size differences, or the fluorescent frag- 
ment analyser for markers exhibiting <1 0-bp allele 



size differences between the parental lines. PCR, elec- 
trophoresis, and data scoring were performed as 
described previously. 36-38 

2.3. Construction of linkage maps and the integrated 
map 

Linkage analysis of segregated genotypic data 
obtained in this study was performed together with 
linkage analysis of the previously investigated 
genotypes with mapped markers on the diploid 
maps 19,20 using JoinMap® version 4. 41 The marker 
loci were roughly classified using the JoinMap® 
Grouping Module with logarithm of odds (LOD) 
scores of 4.0-10.0. The JoinMap® Combine Groups 
for Map Integration Module was used to integrate the 
linkage maps developed in this study and the 1 3 previ- 
ously published maps into a tetraploid map, i.e. the 
cultivated peanut map 25-32,36 and an artificial amphi- 
diploid map. 24 Markerorderand genetic distance were 
calculated using a regression mapping algorithm with 
the following parameters: Haldane's mappingfunction, 
recombination frequency <0.30, and LOD score >2.0. 
The graphical linkage maps were drawn using the 
MapChart program. 42 

2.4. Comparative analysis of the Arachis maps 
with other legume genomes 

BLASTN 43 was used to conduct similarity searches 
(using the nucleotide sequences from which the 
mapped DNA markers were designed) against each 
pseudomolecule of the genome sequences of 
C. cajan, 8 G. max (Glymal), 6 L, japonicus (build 
2. 5), 5 and M. truncatula (Mt3.5v4), 7 with a threshold 
£-value of 1e-20. The graphical comparative maps 
were drawn using the Circos program. 44 

3. Results 

3.1 . Polymorphism screening of the parental lines 
of the mapping populations and construction 
of the linkage maps 
Prior to polymorphism analysis between the paren- 
tal lines, a total of 3902 markers, i.e. 1894 AHGS, 
1571 AHS, and 43 7 AhTE markers, were pre-selected 
from the 6680 AHGS, 3187 AHS, and 1039 AhTE 
markers that have been published. 36-38 The 
markers were pre-selected according to the following 
criteria: (i) AHGS markers were selected if they were 
expected to detect polymorphism among the culti- 
vated peanut lines according to the results of 
in silico polymorphism analysis; 36 (ii) AHS markers 
were selected if they showed polymorphisms 
between cultivated lines and wild species; 37 (iii) 
AhTE markers were selected if they amplified single 
or double bands. 36,38 
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3.7.7. The AF5 population A total of 582 markers 
(14.9%= 582/3902), including 450 AHGS (23.8% = 
450/1 894), 1 09 AHS (6.9%= 1 09/1 571), and 23 
AhTE (5.3% = 23/437) markers, were selected as 
polymorphism candidates between the parental lines 
of the AF5 population, A. duranensis 'K7988' and 
A. stenosperma 'V1 0309'. Out of the 582 polymorphic 
markers, 39 5 generated a total of 437 segregation 
loci, since several markers detected more than one 
locus. Specifically, 359,31 ,4, and 1 marker(s) detected 
1 , 2, 3, and 4 polymorphic loci, respectively, which were 
suffixed with '_aT and '_a2', e.g. AHGS1 647_a1 and 
AHGS1 647_a2, to distinguish the loci each other. The 
437 polymorphic loci, 284 codominant, and 153 
dominant loci, were successfully mapped onto 1 0 LGs 
covering 544 cM, together with 1 60 previously 
mapped polymorphic loci from 158 markers (Fig. 1, 
Table 1 , and Supplementary Table S1 ). 1 9 The average 
markerdensityof the map was 0.9 cM,and segregation 
distortions were observed in 65% (390 loci) of the 
mapped loci (Table 1 ). 

3.1.2. The BF6 population A total of 862 markers 
(22.1%= 862/3902), including 51 3 AHGS (27.1% = 
51 3/1894), 279 AHS (1 7.8% = 279/1 571 ), and 70 
AhTE (1 6.0% = 70/437) markers, were selected as 
polymorphism candidates between the parental lines 
of the BF6 population, A. ipaensis 'K30076' and 
A. magna 'K30097'. Out of the 862 polymorphic 
markers, 624 generated a total of 680 segregation 
loci, since, as with the AF5 population, several of these 
markers detected more than one locus. Specifically, 
579, 3 5, 9, and 1 marker(s) detected 1, 2, 3, and 4 
polymorphic loci, respectively, which were suffixed 
with '_bT and '_b2', e.g. AHGS1478_b1 and 
AHGS1 478_b2, to distinguish the loci each other. The 
680 polymorphic loci, 549 codominant, and 131 
dominant, were successfully mapped onto 10 LGs 
covering 461 cM together with 118 previously 
mapped loci from 1 1 6 markers (Fig. 1 , Table 1 , and 
Supplementary Table S1). 20 The average marker 
density of the map was 0.6 cM, and segregation distor- 
tions were observed in 31 % (2 50 loci) of the mapped 
loci (Table 1). 

3.7.3. The TF6 population Together with the 
parental lines of TF6, the two original diploids of the 
artificial amphidiploid, A. ipaensis 'K30076' and 
A. duranensis 'V1 41 67', were also genotyped to iden- 
tify the genome origin (A or B genome) of the 
polymorphic DNA fragments. A total of 1144 
markers (29.3% = 1 1 44/3902), including 664 
AHGS (35.1%= 664/1 894), 257 AHS (16.4% = 
257/1 571), and 223 AhTE (51 .0% = 223/437) 
were selected as polymorphism candidates between 
the parental lines of the TF6 population, A. hypogaea 
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'Runner IAC 886' and an artificial amphidiploid 
(A. ipaensis 'K30076'xA. duranensis 'V14167') 4x . 
Out of the 1 144 polymorphic markers, 1 055 gener- 
ated a total of 1261 segregation loci, since, as with 
the AF5 and BF6 populations, several of these 
markers detected more than one locus. Specifically, 
879, 1 50, 22, and 4 markers detected 1, 2, 3, and 
4 polymorphic loci, respectively, which were suffixed 
with '_tT and '_t2', e.g. AHGS1991_t1 and 
AHGS1991_t2, to distinguish the loci each other. 
The 1261 polymorphic loci, 970 codominant, and 
291 dominant, were successfully mapped onto 20 
LGs covering 1442cM together with 208 loci from 
180 markers (Moretzsohn et al. unpublished data). 
The 20 LGs were classified into 1 0 pairs of A and B 
genomes, according to the DNA fragment size of 
mapped loci corresponding to 'K30076' or 'V1 41 67' 
(Fig. 1, Table 1, and Supplementary Table S1). The 
average marker density of the map was 1 .0 cM, and 
segregation distortions were observed in 19% (285 
loci) of the mapped loci (Table 1). 

In total, the number of non-redundant AHGS, AHS, 
and AhTE markers mapped on the three maps were 
890, 446, and 211, respectively. Out of the total 
1547 markers, 73 (67 AHGSs and six AHSs) and 
381 (298 AHGSs, 59 AHSs, and 24 AhTEs) were 
mapped on three and two maps, respectively, while 
1 093 (525 AHGSs, 381 AHSs, and 1 87 AhTEs) were 
mapped on just one map; the AF5, BF6, or TF6 map 
(Supplementary Table S1). Comparing the three 
maps, 203 of the 597 mapped loci on the AF5 map, 
and 1 57 out of 798 loci on the BF6 map, were also 
located on the TF6 map (Supplementary Table S1). 

The LGs were designated AA, BB, and TA orTB in the 
AF5, BF6 and TF6 maps, respectively. The nomencla- 
ture of the LGs of the diploid maps was tentatively 
determined according to the commonly mapped 
marker loci between the present and the previously 
constructed maps. 19,20 Eight LGs of the AF5 map 
(AA01 and AA03-AA09) and all 10 of the LGs of 
the BF6 map (BB01 -BB1 0) were assigned to the cor- 
responding LGs on the previous maps (Supplementary 
Table S2). The two LGs of the AF5 map were assigned 
to 'Group 02' in the previous map, 19 and they were 
tentatively named AA02 and AA1 0 (Table 1 and 
Supplementary Table S2). 'Group 1 0' in the previous 
map' 9 was disassembled on the AF5 map. On the 
other hand, in the TF6 map, TA01, TA03-TA09, and 
TB01 -TB1 0 were assigned to the corresponding LGs 
of the AF5 and BF6 maps, and TA02 and TA1 0 were 
assigned to the AA02 and AA1 0 LGs, respectively 
(Table 1 and Supplementary Table S2). After integrat- 
ing all of these data, 1 0 homoeologous groups (HGs) 
were identified on the TF6 map. 

The marker order was almost completely conserved 
between the HGs except for HG04, HG07, and HG08 
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TA01 TA04 BB04 TA07 




Figure 1. Genetic linkage maps obtained for AF5, BF6, and TF6 populations. The linkage groups in the AF5, BF6, and TF6 maps are 
indicated by AA, BB, and TA or TB, respectively, and the homoeologous groups are prefixed with HGs. Homologous or homoeologous 
marker loci are connected by lines. 
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(Fig. 1). In HG04, the lower part of TA04 corre- 
sponded to the upper part of TB04 (Fig. 1). In 
HG07, the loci mapped onto TB07 corresponded to 
those of TA07 and the upper part of TA08. 
Moreover, corresponding loci on TB08 were observed 
on the lower parts of TA08 and TA07 (Fig. 1), indicat- 
ing a translocation between the A and B genomes as 
previously suggested. 20,22,24 

3.2. Integration of the genetic linkage maps 
into a consensus map 
An integrated consensus map was constructed based 
on the segregation genotypes of 1 6 populations: three 
from this study and 13 from previous studies 
(Supplementary Table S2). 1 9 . 20 . 24 - 32 - 36 The integrated 
consensus map was 2651 cM in total length and com- 
prised 20 LGs, on which 3693 loci, including 1564 
AHGS, 569 AHS, 450 AhTE, and 1110 other loci, 
were mapped (Table 2, Fig. 2, and Supplementary 
Table S3), out of which markers mapped on mote 
than two loci were suffixed with '_cT and '_c2', e.g. 
AHGS1403_c1 and AHGS1403_c2, to distinguish 
each other. The average marker density of this map 



was 0.7 cM (Table 2). On the consensus map, 391 
marker loci were commonly mapped onto pairs of 
LGs in each HG, and the order of the markers was 
roughly conserved (Fig. 2). 

In the present study, locus clusters were defined as 
regions in which >10 loci were mapped in 5 cM 
windows. They were observed in all of the LGs 
(Fig. 2, Table 2, and Supplementary Table S3). 
The total length of the locus clusters was 505 cM 
(101 clusters), varying from 1 5 cM (three clusters) 
in A06 and B05 to 40 cM (eight clusters) in A03 
and B03. A total of 231 9 loci, representing 62.8% of 
the mapped loci, were located in 101 clusters, 
which varied in loci content from 53 (A08) to 175 
(AO 3) loci. 



3.3. Comparative mapping between the Arachis 

spp. and the four comparison legumes, C. cajan, 
G. max, L. japonicus, and M. truncatula 
Corresponding sequences were available for 3473 

of the 3693 mapped loci (Supplementary Table S3). 

Of these 3473 sequences, 869 showed significant 

similarity to one of the four legume genome 
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Table 2. Descriptions of the integrated consensus map and 
integrated LGs of the population-specific maps 



LGs 


Locus 
no. 


cM 


cM/ 
locus 


Length of the 
locus cluster 3 


Locus no. 
the mark 
cluster 


A01 


238 


1 83.8 


0.8 


25 


1 58 


B01 


168 


84.7 


0.5 


25 


1 29 


AO 2 


1 26 


78.6 


0.6 


20 


86 


B02 


1 54 


1 1 2.0 


0.7 


25 


1 04 


AO 3 


272 


1 51.6 


0.6 


40 


1 75 


B03 


234 


144.8 


0.6 


40 


1 67 


A04 


213 


1 06.8 


0.5 


20 


1 06 


B04 


208 


1 32.1 


0.6 


30 


1 22 


AO 5 


1 89 


1 26.0 


0.7 


30 


1 35 


B05 


1 69 


1 1 2.7 


0.7 


1 5 


77 


AO 6 


1 87 


1 36.4 


0.7 


1 5 


85 


B06 


1 76 


1 52.5 


0.9 


20 


109 


AO 7 


145 


1 67.0 


1 .2 


20 


76 


B07 


1 62 


1 79.1 


1.1 


25 


1 05 


AO 8 


1 67 


1 26.3 


0.8 


20 


53 


B08 


1 76 


1 07.3 


0.6 


30 


1 38 


AO 9 


1 94 


1 26.5 


0.7 


30 


141 


B09 


1 79 


1 70.5 


1.0 


30 


121 


A1 0 


1 72 


1 31.1 


0.8 


20 


1 1 7 


B1 0 


1 64 


1 21.3 


0.7 


25 


1 1 5 


Total 


3693 


2651.1 


0.7 


505 


231 9 



a Regions in which >1 0 loci were mapped in 5 cM windows. 



sequences (C. cajan, G. max, L. japonicus, and M. trun- 
catula). For each legume species, 51 5, 781 , 436, and 
419 sequences showed similarities to sequences in 
the C. cajan, G. max, L. japonicus, and M. truncatula 
genomes, respectively, and 1 70 sequences showed 
similarity to sequences in all four legume genomes. 

Segmental syntenic regions were observed between 
the Arachis HGsand the genomes of the four compari- 
son legumes (Fig. 3, Table 3, and Supplementary 
Table S3). Most of the LG pairs in each HG showed 
similar synteny relationships to those in the four com- 
parison legumes. Between the four comparison 
legume species, the genome of G. max showed the 
densest synteny of LGs, corresponding to those of 
Arachis. In addition, most of the LGs in Arachis were 
identified as shared syntenies with LGs on each of 
two chromosomes of G. max. This might reflect the 
palaeopolyploid nature of the genomes of these 
species. Comparing Arachis with the other three 
legume genomes, clearly syntenic segments were 
observed as follows: Arachis HG01 shared synteny 
with C. cajan chromosome 1 1 (Cc1 1) and M. trunca- 
tula chromosome 4 (A4t04); Arachis HG04 shared 
synteny with L. japonicus chromosome 4 (1/04); 



Arachis HG05 shared synteny with Cc06, i/02, and 
Mt05; Arachis HG06 shared synteny with L/01 and 
Mt07; Arachis HG07 shared synteny with Cc02, Lj02, 
and Mt05; Arachis HG08 shared synteny with Cc03; 
Arachis HG09 shared synteny with Cc02, L/05, and 
Mt01; and Arachis HG10 shared synteny with LjOI. 
On Arachis HG03, segmental blocks were identified 
that showed shared synteny between the comparison 
legume genomes, while no obvious synteny was 
observed between Arachis HG02 and the genomes 
of C. cajan, L. japonicus, and M. truncatula. 

4. Discussion 

4.1 . Polymorphism potential of the DNA markers 
mapped on the genetic linkage map 

The polymorphism potential of DNA markers in 
Arachis is usually evaluated on the basis of the poly- 
morphism information content (PIC) value. 34 While 
the PIC values of the AHS were estimated from geno- 
typing data from 1 6 Arachis lines, 37 those of the AHGS 
and AhTE have not been investigated thoroughly. 36,38 
In the present study and in previous studies, 36 968 
(416 AHGS, 395 AHS, and 157 AhTE), 666 (392 
AHGS, 65 AHS, and 209 AhTE), 245 (197 AHGS, 1 0 
AHS, and 38 AhTE), 73 (72 AHGS and 1 AhTE), and 
3 (AHGS only) markers were mapped on single, 
double, triple, quadruple, and quintuple maps, 
respectively. Of these, the PIC values of the 395, 65, 
and 10 AHS markers were estimated to be, on 
average, 0.23, 0.28, and 0.36, respectively, from the 
results of Koilkonda et al. 37 A positive correlation 
was observed between the PIC values and the 
number of maps on which the AHS was mapped. 
Therefore, if this estimation can be generalized, the 
AHGS and AhTE markers mapped on multiple maps 
can be considered as potential sources for obtaining 
polymorphic markers in other mapping populations. 

In the screening of polymorphic markers between 
the parental lines of the mapping populations, 
582, 862, and 1 144 markers were selected as poly- 
morphism candidates, but 1 87, 238, and 89 were 
excluded from the mapping analysis. The excluded 
markers might amplify DNAs from not the target 
locus but non-allelic repetitive and putative paralo- 
gous sequences in the Arachis genome, 38,45,46 which 
are not suitable for segregation and linkage analysis. 

4.2. Genetic linkage maps for three mapping 
populations 

Comparison of the AF5, BF6, and TF6 maps revealed 
that the marker order was almost perfectly conserved 
between the HGs, with the exception of HG04, HG07, 
and HG08 (Fig. 1 and Supplementary Fig. S1). This 
result indicated that the A and B genomes of Arachis 
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Figure 2. Comparison of the integrated consensus tetraploid Arachis map with diploid maps. The abbreviations AA and BB indicate LGs on 
the AF5 and BF6 maps, respectively, and the prefixes A and B show LGs on the integrated consensus map. Horizontal lines on the LGs 
indicate the positions of the mapped loci; the loci derived from one, two, three, four, five, and more than five genetic linkage maps are 
shown in black, blue, light blue, green, yellow, and red. Vertical bars on the left side of the LGs indicate the locus clusters. Homologous 
and homoeologous loci are connected by black lines. 



evolved from a common ancestor species. Exceptions 
to this shared marker order were found within 
HG04, and between HG07 and HG08. The former 
was observed for the first time in this study, while 
the latter confirms previous observations. 20,24 These 
findings suggest that the chromosome translocations 
might have occurred at, or after, the divergence of 
the A and B genomes in Arachis. 

The AF5 and BF6 linkage maps have been improved 
by subsequent mapping of the newly developed 
markers after they were originally published. 19,20 
However, despite the increase in the number of 
mapped loci from 1 70 to 597 in the AF5 map and 
from 149 to 798 in the BF6 map, the total map 
length decreased from 1231 to 544 cM in the AF5 



map and from 1294 to 461 cM in the BF6 map. In 
the maps generated in the present study, the marker 
loci that mapped to the ends of LGs in previous 
studies 19,20 were excluded, and the intervals 
between the markers were shorter than those in pre- 
vious maps. Mapping telomere-based makers will 
clarify whether the maps from this study have been 
saturated or not. 

In the linkage map for the TF6 population, the total 
length of the map was 1442 cM, which was shorter 
than that of the maps reported for Arachis tetraploids 
(221 OcM with 3 70 loci; 23 21 66 cM with 1114 
loci; 36 1 844 cM with 298 loci; 24 1 785 cM with 191 
loci 29 ). Nagy et al. 21 reported that the length of the 
genetic linkage map in A. duranensis ranged from 
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A Arachis vs. C. cajan B Arachis vs. 6. max 




C Arachis vs. L japonicus D Arachis vs. M. truncatula 




Figure 3. Comparative maps of Arachis and four other legumes, C. cajan, C. max, L. japonicus, and M. truncatula. Circled bars in brown, 
magenta, green, yellow, and blue indicate the genomes of Arachis, C. cajan (A), G. max (B), L. japonicus (C), and M. truncatula (D), 
respectively. Homologous loci are connected by lines. 



Table 3. Similarity of other legume chromosomes with that of 
Arachis homoeologous groups 



Arachis 


C. cajan 


G. max 


L. 

japonicus 


M. 

truncatula 


HG01 a 


1 1 


5, 7 


1 3, 1 7 


4, 6 


4 


HC02 




3, 7 








HC03 


4, 5, 8, 9, 
10 


4, 5 
1 2, 


6, 8, 9, 1 1, 
1 8 


3, 4 


4, 8 


HC04 


7, 8 


3, 7 
1 5, 


8, 9, 1 2, 1 3, 
1 8, 20 


6 


2, 7 


HC05 


6 


1,9 


11,16 


2 


5 


HC06 


2, 3, 7 


2, 3 


16, 19 


1 


7 


HC07 


2 


2, 6 


1 3 


2 


5 


HG08 


3 


4, 6 


8, 1 2, 1 3 


1, 3 


2, 3 


HC09 


2 


7, 1 0, 20 


5 


1 


HG1 0 


1, 8 


10, 


1 5, 1 9 


1 


2, 3 



a HG indicates homologous group. 



1081.1 to 2056.5 cM (depending on the mapping 
program used) when 1 054 segregation loci were 
used for the linkage analysis. Therefore, the different 
programs, algorithms, and functions used for 
mapping may explain the differences in map length 
reported by the present and previous studies. Our pre- 
liminary analysis, in which MapMaker program 47 gave 
longer map lengths than the JoinMap, also supported 
these explanations. Alternatively, as Sim et al. 48 sug- 
gested (based on linkage analysis in tomato), different 
ratios between the number of markers and popula- 
tion size in each population might have resulted in 
the length differences. 

Segregation distortions were observed in the three 
maps, the AF5, BF6, and TF6, and the ratios were dif- 
ferent among the populations and LGs (Table 1). 
Similar observations were found in our previous 
studies on tomato on which high and low ratios of 
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segregation distortion were observed in inter- and 
intraspecific maps, respectively. 49,50 Segregation dis- 
tortion is generally caused by the chromosomal struc- 
tural differences or the presence of transmission 
ration distorter factors on some chromosome. 51 Our 
findings will contribute to identify factors for the 
segregation distortion. 

4.3. Characterization of the cultivated peanut genome 
using the Arachis consensus map 

The consensus map was constructed using segrega- 
tion data from 1 6 populations, in which five species, 
A. duranensis (A), A. stenosperma (A), A. ipaensis (B), 
A. magna (B), and A. hypogaea (AB), were involved 
(Supplementary Table S2). A total of 889 loci 
derived from 421 markers were commonly mapped 
onto all of the paired LGs as HGs; however, the locus 
orders were not always conserved. The regions in 
which mapped loci showed non-corresponding 
orders between the paired LGs in each HG were 
considered as candidate regions having A or B 
genome-specific structures. Most of the locus orders 
were conserved between AA (diploid) and A (tetra- 
ploid) LGs, or between BB (diploid) and B (tetraploid) 
LGs (Fig. 2). However, several regions showed evidence 
of rearrangement, such as between BB03 and B03, 
BB09 and B09, and AA1 0 and A1 0. Arachis duranensis 
and A. ipaensis are considered to be the most probable 
ancestors of A. hypogaea, because of similar 
karyotypes of A. duranensis and A. ipaensis to that of 
A. hypogaea} 2 '^ 3 By contrast, even though the struc- 
tures of A genome of different species are well con- 
served, the A. stenosperma genome is distinguishable 
from that of A. duranensis on the basis of the variabil- 
ity observed in the heterochromatin and 18S-26S 
rRNA loci. 13 It is considered the possibility that such 
differences in the genome structure of the parental 
lines of the mapping populations might disrupt 
the marker order between diploid and tetraploid 
consensus maps. 

The average marker density of the consensus map 
was 0.7cM/locus (Table 2). Because the genome 
size of A. hypogaea is estimated to be ~2.8Gb, 16 
the DNA markers were located at <760 kb intervals 
on average. In other words, a map-based cloning strat- 
egy has become a realistic approach for molecular 
genetics in Arachis spp., because a candidate region 
for the location of a target gene can, on average, 
be identified to within 760 kb. Furthermore, this 
high-density map should aid in genome sequencing 
analysis in Arachis spp., because the locus order 
can work as a reference for the orientation of 
sequence contigs and scaffolds. Moreover, the marker 
clusters were observed in all of the LGs (Fig. 2 and 
Supplementary Table S3). Because chromosomal 



crossover, and the resulting genetic recombination, 
are severely suppressed in heterochromatin, 6,49 the 
clustered regions were speculated to be located in het- 
erochromatin, which makes up 1 0-20% of the Arachis 
chromosomes in somatic metaphase. 1 3,1 5 In the case 
of soybean, broad pericentromeric regions show 
markedly diminished recombination (4.2 Mb/cM), 
while the euchromatic regions at the chromosome 
ends maintain consistent levels of recombination 
(193kb/cM). 6 This pattern is also observed in 
tomato, with 6042 and 1 72 kb/cMin heterochromat- 
ic and euchromatic regions, respectively. 49 As with 
soybean and tomato, it was predicted that the physical 
marker-to-marker distance intervals in the peanut 
genome were different between heterochromatic and 
euchromatic regions. 

4.4. Comparative genomics of Arachis spp. with 

C. cajan, G. max, L. japonicus, and M. truncatula 

All of the Arachis HGs showed similarities to the 
genomes of the four legumes (Fig. 3, Table 3, and 
Supplementary Table S2). The genetic composition 
of Arachis corresponded to that of the C. cajan, 
L. japonicus, and M. truncatula genomes on a one- 
on-one basis, but was doubled when compared with 
the C. max genome. These observations support the 
estimation that a whole-genome duplication occurred 
in legumes ~58 million years ago (mya), at the time 
of the Papilionoid origin, 52 whereas the soybean-spe- 
cific duplication is estimated to have occurred <13 
mya. 6 These insights are based on a comparison of 
the A genome species between Arachis and L. japoni- 
cus and M. truncatula? 9 

At the chromosome level, each chromosome of 
Arachis, which belongs to the Dalbergioids, might 
have evolved differently and show different rearrange- 
ments from those in the ancestral chromosomes of 
the legumes after isolation from the other subfam- 
ilies, the Phaseoloids and Galegoids, ~55 mya. 52 Of 
the 10 Arachis HGs, HG01, 05, 07 and 09 showed 
microsyntenies with the chromosomes of C. cajan, 
L. japonicus, and A/1, truncatula. This suggested the pos- 
sibility that the major chromosome rearrangements 
did not occur after the time of divergence of subfam- 
ilies in the Papilionoideae. By contrast, HG02 showed 
the most disparate syntenies with the genomes of 
C. cajan, G. max, L. japonicus, and M. truncatula 
(Table 3), which suggested frequent chromosome 
rearrangement. 

5. Conclusion 

In this study, we developed three linkage maps for 
Arachis spp., and anchored the LGs to the A and B 
genomes. Integration of the tetraploid maps 
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developed in this study with the 13 previously pub- 
lished studies generated a high-density consensus 
map of tetraploid Arachis. The developed maps identi- 
fied structural features within the Arachis genome 
using comparative genomic analysis, and also identi- 
fied differences between the Arachis genome and 
that of other legumes, C. cajan, G. max, L. japonicus, 
and M. truncatula. The results obtained in this study 
will bridge the gaps in our knowledge regarding the 
genomes of Arachis and other legumes, and will 
further the genetic/genomic study and molecular 
breeding of Arachis. 

5.1. Availability 

Information for the genetic linkage maps and the 
integrated consensus map is available at http:// 
marker.kazusa.or.jp. 
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